Interpreting Blackbox Models via Model Extraction
نویسندگان
چکیده
ABSTRACT The ability to interpret machine learning models has become incredibly important as machine learning is increasingly used to inform consequential decisions. We propose an approach to interpreting complex, blackbox models by constructing global explanations that summarize their reasoning process. In our approach, a global explanation is a decision tree that approximates the blackbox model. As long as the decision tree is a good approximation, then the reasoning process of the decision tree mirrors that of the blackbox model. We devise a novel algorithm for extracting decision tree explanations that actively samples new training points to avoid overfitting. We evaluate our algorithm on a random forest to predict diabetes risk, and a learned control policy for the cart-pole problem. Compared to several baselines, the decision trees extracted by our algorithm are substantially more accurate and are equally or more interpretable based on a user study. Finally, we describe several insights we derived based on our interpretations, including a causal issue that we validated with a physician.
منابع مشابه
Interpretability via Model Extraction
e ability to interpret machine learning models has become increasingly important now that machine learning is used to inform consequential decisions. We propose an approach called model extraction for interpreting complex, blackbox models. Our approach approximates the complex model using a much more interpretable model; as long as the approximation quality is good, then statistical properties...
متن کاملLong-term Iran's inflation analysis using varying coefficient model
Varying coefficient Models are among the most important tools for discovering the dynamic patterns when a fixed pattern does not fit adequately well on the data, due to existing diverse temporal or local patterns. These models are natural extensions of classical parametric models that have achieved great popularity in data analysis with good interpretability.The high flexibility and interpretab...
متن کاملBayesian Analysis of Spatial Probit Models in Wheat Waste Management Adoption
The purpose of this study was to identify factors influencing the adoption of wheat waste management by wheat farmers. The method used in this study using the spatial Probit models and Bayesian model was used to estimate the model. MATLAB software was used in this study. The data of 220 wheat farmers in Khouzestan Province based on random sampling were collected in winter 2016. To calculate Bay...
متن کاملTowards blackbox identity testing of log-variate circuits
Derandomization of blackbox identity testing reduces to extremely special circuit models. After a line of work, it is known that focusing on circuits with constant-depth and constantly many variables is enough (Agrawal,Ghosh,Saxena, STOC’18) to get to general hitting-sets and circuit lower bounds. This inspires us to study circuits with few variables, eg. logarithm in the size s. We give the fi...
متن کاملExtraction Kinetics and Physicochemical Studies of Terminalia catappa L Kernel Oil Utilization Potential
Kinetics and selected variables (temperature, particle size and time) for extraction of Terminalia Catappa L Kernel Oil (TCKO) were investigated using solvent extraction. Kinetic models studied were: parabolic diffusion, power law, hyperbolic, Elovich and pseudo-second-order. In ascending order, the best-fitted models at the optimum temperature and oil yield were Elovich’s model, hyperbolic...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
عنوان ژورنال:
- CoRR
دوره abs/1705.08504 شماره
صفحات -
تاریخ انتشار 2017